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AUTOMATED CONTENT INTEGRITY VALIDATION 
FOR STREAMING DATA 

FIELD OF THE INVENTION 

[0001] The present invention relates to the field of electronic content distribution. 
More particularly, the present invention relates to the field of streaming media content 
distribution. 

BACKGROUND OF THE INVENTION 

[0002] Streaming is a technique for transferring data such that it can be processed as a 
steady and continuous stream. Many real-time events, such as live news items, have to 
be streaming events or else they may become much less relevant and less valuable. 
Also, streaming allows a personal computer (PC) user connected to the hitemet to play 
a multimedia file, such as an audio or video file, in real time without having to wait for 
the entire file to be downloaded. This is important because many users do not have 
sufficiently fast access to the Internet to download a large multimedia file in an 
adequately short amount of time. Typically, data streams are generated and distributed 
by streaming content providers (SCPs). 

[0003] An SCP may map a data stream to a Uniform Resource Locator (URL) to 
provide users access to the data stream through readily available player programs which 
are commonly integrated with Internet browser programs. To begin a viewing or 
listening session, a user may direct a player program or browser program to the URL of 
a desired data stream. However, a user may encounter unexpected interruptions during 
playback caused by drop-outs in dehvery. Drop-outs may be caused by network 
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delivery problems introduced through various elements in distribution logic used to 
deliver the data stream from its source to the end user. 

[0004] An SCP may provide streaming content generation and delivery services to 
resellers of streaming content, such as Internet radio stations and Intemet service 
providers (ISPs), who provide end users access to the streaming content. It is common 
for a service level agreement (SLA) between an SCP and a reseller to include a 
provision that commits the SCP to a specified level of content delivery accuracy, as 
well as penalty provisions if the specified level of accuracy is not achieved. 

[0005] In today's Intemet content delivery systems, there are no mechanisms to ensure 
that integrity of streaming content is preserved from its source to its destination, for 
example, an end user. Therefore, in an attempt to assure accuracy, an SCP may employ 
human operators to direct a player program to the URL of a data stream in order to 
validate that the content is sfreamed correctly. To employ human operators for content 
integrity validation (CIV) is expensive and directly impacts scalability of operations 
due to the limited number of data streams a human operator can validate. Further, 
because the process involves hviman interaction, the possibility of human error exists. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



[0006] The present invention is illustrated by way of example and not limitation in the 
figures of the accompanying drawings, in which like references indicate similar 
elements, and in which: 

[0007] Figure 1 shows a global view of an exemplary streaming content delivery 
network (SCDN) capable of performing automated content integrity validation (CIV) 
according to the present invention. 

[0008] Figure 2 shows a client-server view of a SCDN according to one embodiment of 
the present invention. 

[0009] Figure 3 shows another client-server view of a SCDN according to one 
embodiment of the present invention. 

[0010] Figure 4 shows a flow diagram illustrating, for one embodiment, the operation 
of CIV according to the present invention. 

[0011] Figure 5 shows an exemplary data packet containing a fingerprint block and 
sampling parameters. 

[0012] Figure 6 shows a flow diagram illustrating, for one embodiment, the operation 
of CIV to accommodate a level of loss in streaming content delivery. 
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DETAILED DESCRIPTION 



[0013] The following detailed description sets forth an embodiment or embodiments in 
accordance with the present invention. In the following description, numerous details 
are set forth. It will be apparent, however, to one skilled in the art, that the present 
invention may be practiced without these specific details. Iti other instances, well- 
known structures and devices are shown in block diagram form, rather than in detail, in 
order to avoid obscuring the present invention. 

AN EXEMPLARY STREAMING CONTENT DELIVERY NETWORK 

[0014] Figure 1 illustrates a global view of an exemplary streaming content delivery 
network (SCDN) 100 of a streaming content provider (SCP) capable of performing 
automated content integrity validation (CIV), according to one embodiment of the 
present invention. As illustrated, an SCDN may comprise a server 102, clients, such as 
client 104 and distribution logic 106. The server may comprise any suitable computer 
system and equipment to generate and dehver streaming media content to other 
computer systems, such as the client, through the distribution logic. The client may 
comprise any suitable computer system to receive streaming media content from the 
server through the distribution logic. For one embodiment, the client may be a personal 
computer (PC) of an end user connected to the Intemet. 

[0015] The distribution logic may comprise a public network such as the Intemet, a 
private network such as a corporate network, or any combination of suitable pubhc and 
private networks. The distribution logic may also comprise last mile technologies used 
by an end user to connect to the Intemet, such as a modem or digital subscriber line 
(DSL). As illustrated, for one embodiment, distribution logic may comprise a global 
load balancer 1 12, splitter pools with local load balancers, such as splitter pool 1 10, and 
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the public Internet. Global load balancers, local load balancers, splitters, and other 
elements in the distribution logic create additional switching points between the server 
and client that may introduce network delivery problems resulting in lost packets. 
Problems with last mile technologies used by a client to coimect to the Internet, such as 
poor quality telephone lines, may also result in lost packets. 

[0016] To detect lost packets, an SCP may perform automated CIV. For one 

embodiment, the server may sample a data stream with a fingerprint block generator 
(FBG) 120 to generate one or more fingerprint blocks for the data stream. The server 
may send the one or more fingerprint blocks generated at the server to a client 
requesting the data stream. The chent, downstream of the distribution logic, may 
sample the data stream with a fingerprint block validator (FBV) 122 to generate one or 
more fingerprint blocks for the data stream at the client. The client may compare one 
or more fingerprint blocks generated at the client to one or more fingerprint blocks 
generated at the server. Missing or mismatched fingerprint blocks may indicate lost 
packets due to possible network delivery problems. 

[00171 Figure 2 illustrates a client-server view of SCDN 100 that provides further 
detail of server 102 and client 104, according to one embodiment of the present 
invention. As illustrated, the server may comprise an encoder 206, a fingerprint block 
generator (FBG) 120, and a packetizer 210. The encoder, the FBG 120, and the 
packetizer 210 may be implemented in software or hardware. For example, as 
illustrated in Figure 3, the server may comprise a processor 302 and a memory 304, to 
store data and instructions for execution by processor 302 to implement the encoder 
206, the FBG 120, and the packetizer 210. 
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[0018] The encoder may encode a media signal to create a data stream of an 
appropriate format. For one embodiment, encoding may comprise sampling the media 
signal and compressing the resulting information so that it can efficiently be sent over 
the Litemet. As an example, because many users connect to the Intemet using 28.8 
kilobits per second (kbps) modems, the server may comprise an encoder that 
compresses an audio signal into a 20 kbps audio data stream. 

[0019] The FBG may sample the data stream created by the encoder to generate a 
fingerprint block for the sampled portion of the data stream. The FBG may generate a 
fingerprint block according to any suitable method, for example, that may generate a 
different fingerprint block if a single bit of data in the sampled portion is changed. For 
one embodiment, the FBG may generate a fingerprint block by sampling the data 
stream and generating a cyclic redundancy check (CRC) on a sampled portion. CRC 
algorithms are well known in the art, and any suitable form of CRC algorithm may be 
used to generate fingerprint blocks for a data stream. For example, fingerprint blocks 
maybe 16-bit (CRC-16) or 32-bit (CRC-32) values, generated using any suitable CRC 
polynomial. CRC algorithms may be implemented in hardware or software. For one 
embodiment, the FBG may sample portions of a data stream for predetermined periodic 
intervals and generate fingerprint blocks for the sampled portions. As an example, the 
FBG may periodically sample a predetermined length of the data stream that is 
equivalent to a predetermined amount of playback time. 

[0020] As illustrated in Figures 2 and 3, encoded data created by the encoder may be 

fed to the packetizer. For one embodiment, the packetizer may create packets of 
streaming data by appropriately iragmenting a continuous encoded bit stream from the 
encoder such that the data packet size fits within network imposed limits. The 
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packetizer may also place fingerprint blocks for a data stream generated by the FBG in 
data packets to be sent to a client requesting the data stream. 

[0021] It may be important for a client requesting the data stream to have the ability to 
sample the data stream in the same manner as the server in order to generate fingerprint 
blocks for the same sampled portion. Therefore, for one embodiment, the FBG may 
output fingerprint blocks to the packetizer, along with sampling parraieters used to 
generate the fingerprint blocks. The packetizer may place the sampling parameters, 
along with the fingerprint blocks in data packets to be sent to a client requesting the 
data stream. The client requesting the data stream may use the sampling parameters to 
sample the data stream in the same manner as the server, and may generate a fingerprint 
block for the same sampled portion as the server. 

[0022] As illustrated in Figures 2 and 3, a chent may comprise a processor 212 and a 

memory 214, to store data and instructions for execution by processor 212. For one 
embodiment, a memory may have stored therein a set of instructions to implement a 
fmgerprint block vaHdator (FBV) 122 and a player program 218. 

[0023] The player program may be a proprietary or readily available player program 
capable of playing streammg content. Examples of readily available player programs 
include Windows Media ™ Player available fi-om Microsoft Corporation and 
RealPlayer ® available firom RealNetworks, Inc. The player program may be 
integrated into an Internet browser program as a plug-in application that is loaded when 
the browser program detects an incommg data stream of an appropriate format. 
Examples of the Intemet browser program include Internet Explorer available from 
Microsoft Corporation and Netscape Navigator ® available from Netscape 
Communications Corporation. 
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[0024] The FBV may be a plug-in module integrated with the player program. The 
FBV may sample the data stream and generate fingerprint blocks for the sampled 
portion. For one embodiment, the FBV may receive fingerprint blocks generated at the 
server. As previously described, the FBV may also receive sampling parameters fi-om 
the server. For one embodiment, the FBV compares fingerprint blocks generated at the 
client to fingerprint blocks generated at the server. 

[0025] As illustrated, while the server and the cHent may comprise memory, they may 
comprise any machine-readable medium, including any mechanism that provides 
information in a form readable by a machine, such as a computer. For example, a 
machine-readable medium may include read only memory (ROM), random access 
memory (RAM), magnetic disk storage media, optical storage media, flash memory 
devices, and electrical, optical, acoustical, or other forms of propagated signals. 

AUTOMATED CONTENT INTEGRITY VALIDATION 

[0026] Flow diagram 400, of Figiire 4, illustrates the operation of automated content 
integrity validation (CIV) according to one embodiment of the present invention. For 
block 402, the client requests a data stream from the server. For example, the request 
may result firom an end user directing a player program to a URL of the data stream. 
For block 404, the server samples the data stream and generates one or more fingerprint 
blocks for one or more sampled portions of the data stream at the server. For block 
406, the server sends the data stream to the client. 

[0027] For block 408, the client obtains one or more fingerprint blocks generated at the 
server. For one embodiment, one or more fingerprint blocks generated at the server are 
sent to the client through the distribution logic, in response to the request for the data 
stream. For one embodiment, the server may send one or more fingerprint blocks to the 
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client through a first connection between the server and the client while sending the 
data stream to the client through a second connection between the server and the client. 
For example, the server may send the data stream to the client through a primary data 
connection while sending fingerprint blocks to the client through an out-of-band 
connection. 

[0028] For one embodiment, the client may obtain one or more fingerprint blocks prior 
to requesting the data stream. This may be possible if the requested data stream is 
delivered on-demand, such as a data stream for a pre-recorded song. Because the 
content of a data stream delivered on-demand may be predetermined, fingerprint blocks 
may be generated prior to receiving a request for the data stream. 

[0029] For one embodiment, the client may receive a formatted electronic mail (e-mail) 
message containing fingerprint blocks generated for a data stream delivered on- 
demand, download the fingerprint blocks fi-om a website, or install them as an 
automatic update for the player program. Fingerprint blocks obtained prior to 
requesting a data stream may be stored in a database on the client. It should be noted 
that fingerprint blocks may not be generated ahead of time for Hve data streams because 
the content of a live data stream is not predetermined. 

[0030] For block 410, the client receives the data stream. For block 412, the chent 

samples the data stream and generates one or more fingerprint blocks for one or more 
sampled portions of the data stream at the client. For one embodiment, the chent may 
sample the data stream in accordance with sampling parameters obtained firom the 
server. As previously described, the server may send the sampling parameters to the 
cUent in data packets containing fingerprint blocks. 
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[0031] Figure 5 illustrates, for one embodiment, a portion of an exemplary data packet 
500 having a fingerprint block and sampling timestamps defining the portion of the 
data stream sampled to generate the fingerprint block. As illustrated, the first field may 
contain a beginning timestamp, the second field may contain an endmg timestamp, and 
the third field may contain a fmgerprint block, for example a 32-bit CRC value. As an 
example, a timestamp may have the following format: 

hours .-minutes : seconds : frames, subframes 
to indicate a precise point in the data stream. Therefore, a client receiving a data stream 
may sample the data stream firom the point indicated by the beginning timestamp to the 
point indicated by the ending timestamp, and generate a fingerprint block for the 
sampled portion. 

[0032] Referring back to Figure 4, for block 414, the cUent compares one or more 
fingerprint blocks generated at the client to one or more fingerprint blocks generated at 
the server. If the compared fingerprint blocks match, the integrity of the content of the 
data stream is validated, and the YES branch of block 416 is taken. 
[0033] For block 418, the cHent communicates a valid status message to the server and 
continues processing the data stream. The server may, for example, monitor a 
connection between the server and the chent for a vaUd status message after sending a 
data stream fi-om the server to the chent. Absence of the vahd status message fi-om the 
client may indicate a network delivery error. For one embodiment, the server may 
generate an alarm message if a predetermined amount of time passes without receiving 
a vahd status message from the client to notify appropriate personnel of a potential 
network delivery error. For one embodiment, the server may continue to sample the 
data stream and generate fingerprint blocks for sampled portions of the data stream 
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while sending the data stream to the client. Therefore, steps 404 through 416 may be 
repeated while the client receives liie data stream. 

[0034] If one or more fingerprint blocks generated at the client do not match one or 
more fingerprint blocks generated at the server, content integrity of the data stream is 
invalidated and the NO branch of block 416 is taken. Mismatched fingerprint blocks 
may indicate a network delivery error. 

[0035] Therefore, for block 420, an error message is generated at the client, to indicate 
that a Crv error has occurred. The error message may allow a user to retry, for 
example, to request the data stream fi:om the server again, or exit without retrying. For 
one embodiment, the FBV may generate an error message if a requested data stream is 
not received in a predetermined maximum time period. For one embodiment, a player 
program may optionally disable CIV in order to play data streams delivered from 
servers that do not generate fingerprint blocks for delivered data streams. 
[0036] For block 422, the client communicates an error message to the server to 
indicate that a CIV error has occurred. When a CTV error is received by the server, for 
example, appropriate personnel may be notified of a possible network delivery error. 
For one embodiment, diagnostic information may be included with the error message, 
such as the URL of the requested data stream and fingerprint blocks generated, or 
received for the data stream. Such diagnostic information may assist appropriate 
personnel in determining the source of the error, for example, the element in the 
distribution logic that may have caused the loss of content integrity. 
[0037] According to the method illustrated by flow diagram 400, a single lost packet 
may cause mismatched fingerprint blocks, which may result in invalidation of content 
integrity for a data stream. However, for one embodiment, a method of automated 
content integrity validation may accommodate some number of lost packets in a 
streaming content delivery system. 
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[0038] Flow diagram 600, of Figure 6, illustrates the operation of a method to perform 
content integrity validation that allows validation of content integrity after a number of 
mismatched fingerprint blocks. Flow diagram 400 and flow diagram 600 comprise the 
same operations through block 614, where the chent compares one or more fingerprint 
blocks generated at the client to one or more fingerprint blocks generated at the server. 
However, for block 616, if a threshold percentage of the compared fingerprint blocks 
match, the YES path of block 616 is taken. If a threshold percentage of the compared 
fingerprint blocks do not match, content integrity of the data stream is invaUdated and 
the NO branch of block 616 is taken. 

[0039] For one embodiment, a percentage of fingerprint block matches may be 
calculated by dividing a number of fingerprint blocks generated at the client that match 
fingerprint blocks generated at the server by a total number of fingerprint blocks 
generated at the client. For example, if each fingerprint block generated at the cUent 
matches a fingerprint block generated at the server, the percentage may be one hundred. 
Therefore, by estabhshing a threshold percentage below one hundred, the method 
illustrated in flow diagram 600 may accommodate a number of lost packets that result 
in mismatched fingerprint blocks. For one embodiment, a threshold percentage may be 
adjusted to accommodate connections with varying levels of loss. For another 
embodiment, a threshold number, rather than a threshold percentage, may be 
established so that if the threshold number of fingerprint blocks generated at the server 
do not match fingerprint blocks generated at the server, content integrity of the data 
stream is invalidated. 

[0040] As previously described, the client may establish a primary data connection 
with the server to receive the data stream, as well as an out-of-band connection with the 
server. The out-of-band connection may be used to communicate CIV status messages 
to the server while the client is receivmg the data stream through the primary data 
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connection. The out-of-band connection may enable the server to take appropriate 
action to correct a CIV error and monitor the out-of-band connection to determine if the 
error is still present. 

[0041] For one embodiment, a log file may be created and updated based on CIV 
results. The log file may be created at the server or the client. For example, the log file 
may contain data indicative of the number of successful and unsuccessful validations 
that occurred for a particular data stream, along with associated time stamps. This data 
may be used to determine a measure of content delivery accuracy. Such a measure may 
be useful in determining if a level of content delivery accuracy specified in a service 
level agreement (SLA) between a streaming content provider (SCP) and a reseller has 
been achieved. For one embodiment, a log file generated at the client may be uploaded 
periodically by the server. 

[0042] For one embodiment, specific versions of fingerprint block validators (FBVs) 
may integrate with existing, readily available player programs, as plug-in modules, to 
enable the player programs to perform automated CIV. For example, one plug-in 
module may integrate with Windows Media ™ Player player program, while another 
plug-in module may integrate with RealPlayer ® player program. 
[0043] For one embodiment, an automated CW system may be developed to provide 
an SCP with information regarding its delivery network. For example, referring back 
to Figure 1, an SCP may estabhsh a plurality of clients downsti-eam of the distribution 
logic used to deliver st-eaming content from the server to the clients. The cHents may 
be physically located to access sti-eaming content tiu-ough different paths in the 
distiibution logic, for example, through different splitter pools. Special player 
programs may be developed and utilized at the clients, for example, to periodically 
access streaming content provided by the SCP through a list of URLs, and log the 
results as previously described. Diagnostic information received at the server from the 
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different clients may provide the SCP with greater assurance that the integrity of 
streaming content is preserved through different paths in the distribution logic, or may 
indicate elements in the distribution logic that may have perturbed the integrity of the 
streaming content. 

[0044] In the foregoing description, the invention has been described with reference to 
specific exemplary embodiments thereof. It will, however, be evident that various 
modifications and changes may be made thereto without departing from the broader 
spirit or scope of the present invention as defined in the appended claims. The 
specification and drawings are, accordingly, to be reg^ded in an illustrative rather than 
a restrictive sense. 
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